Allele-specific expression analysis methods for high-density SNP microarray data

نویسندگان

  • Ruijie Liu
  • Ana-Teresa Maia
  • Roslin Russell
  • Carlos Caldas
  • Bruce A. Ponder
  • Matthew E. Ritchie
چکیده

MOTIVATION In the past decade, a number of technologies to quantify allele-specific expression (ASE) in a genome-wide manner have become available to researchers. We investigate the application of single-nucleotide polymorphism (SNP) microarrays to this task, exploring data obtained from both cell lines and primary tissue for which both RNA and DNA profiles are available. RESULTS We analyze data from two experiments that make use of high-density Illumina Infinium II genotyping arrays to measure ASE. We first preprocess each data set, which involves removal of outlier samples, careful normalization and a two-step filtering procedure to remove SNPs that show no evidence of expression in the samples being analyzed and calls that are clear genotyping errors. We then compare three different tests for detecting ASE, one of which has been previously published and two novel approaches. These tests vary at the level at which they operate (per SNP per individual or per SNP) and in the input data they require. Using SNPs from imprinted genes as true positives for ASE, we observe varying sensitivity for the different testing procedures that improves with increasing sample size. Methods that rely on RNA signal alone were found to perform best across a range of metrics. The top ranked SNPs recovered by all methods appear to be reasonable candidates for ASE. AVAILABILITY AND IMPLEMENTATION Analysis was carried out in R (http://www.R-project.org/) using existing functions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Npgrj_nmeth_1194 307..309

We describe a high-throughput method, named ChIP-SNP, for the identification of allele-specific protein-DNA interactions throughout the human genome. ChIP-SNP combines chromatin immunoprecipitation (ChIP) with whole-genome single nucleotide polymorphism (SNP) genotyping microarray analysis. We demonstrated that it can be used to accurately identify allele-specific binding of RNA polymerase II (...

متن کامل

Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine

We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...

متن کامل

Gene Identification from Microarray Data for Diagnosis of Acute Myeloid and Lymphoblastic Leukemia Using a Sparse Gene Selection Method

Background: Microarray experiments can simultaneously determine the expression of thousands of genes. Identification of potential genes from microarray data for diagnosis of cancer is important. This study aimed to identify genes for the diagnosis of acute myeloid and lymphoblastic leukemia using a sparse feature selection method. Materials and Methods: In this descriptive study, the expressio...

متن کامل

The miR526b-5p-Related Single Nucleotide Polymorphisms, rs72618599, Located in 3\'-UTR of TCF3 Gene, is Associated with the Risk of Breast and Gastric Cancers

Introduction: Single nucleotide polymorphisms result in dysregulation of the proto-oncogene TCF3 gene, which is associated with the development, metastasis, and chemoresistance of different malignancies. Methods: GSE10810 microarray dataset and GEPIA2 online software were used to find differentially expressed genes and the TCF3 status in breast cancer (BC) and gastric cancer (GC), respectively....

متن کامل

A multi-array multi-SNP genotyping algorithm for Affymetrix SNP microarrays

MOTIVATION Modern strategies for mapping disease loci require efficient genotyping of a large number of known polymorphic sites in the genome. The sensitive and high-throughput nature of hybridization-based DNA microarray technology provides an ideal platform for such an application by interrogating up to hundreds of thousands of single nucleotide polymorphisms (SNPs) in a single assay. Similar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 28 8  شماره 

صفحات  -

تاریخ انتشار 2012